Navigating Through Temporal Difference

نویسنده

  • Peter Dayan
چکیده

Barto, Sutton and Watkins [2] introduced a grid task as a didactic example of temporal difference planning and asynchronous dynamical pre>gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Memory-Based Learning Schemes for Robot Navigation in Discrete Grid-Worlds with Partial Observability

Abstract In this paper we tackle the problem of robot navigation in discrete grid-worlds using memory-based learning schemes. Different memory-based approaches are tested for navigating an agent across a discrete but partially observable world, and the significance of memory structure is examined. Further, the effects of additional memory hierarchies and multi-level learning frameworks are anal...

متن کامل

Control of Multivariable Systems Based on Emotional Temporal Difference Learning Controller

One of the most important issues that we face in controlling delayed systems and non-minimum phase systems is to fulfill objective orientations simultaneously and in the best way possible. In this paper proposing a new method, an objective orientation is presented for controlling multi-objective systems. The principles of this method is based an emotional temporal difference learning, and has a...

متن کامل

Navigating Multimodal Meeting Recordings with the Meeting Miner

We present Meeting Miner, a multimodal meeting browser for navigating recordings of online text and speech collaborative meetings. Meetings are recorded through a collaborative writing environment specially designed to capture participants activities. This information, usually lost in common recordings of multimodal meetings, offers novel possibilities for indexing, navigation and information r...

متن کامل

Politics and Power in Global Health: The Constituting Role of Conflicts; Comment on “Navigating Between Stealth Advocacy and Unconscious Dogmatism: The Challenge of Researching the Norms, Politics and Power of Global Health”

In a recent article, Gorik Ooms has drawn attention to the normative underpinnings of the politics of global health. We claim that Ooms is indirectly submitting to a liberal conception of politics by framing the politics of global health as a question of individual morality. Drawing on the theoretical works of Chantal Mouffe, we introduce a conflictual concept of the political as an alternative...

متن کامل

Navigating Through Multiple Temporal Granularity Objects

Managing and relating temporal information at different time units is an important issue in many applications and research areas, among them temporal object-oriented databases. Due to the semantic richness of the objectoriented data model, the introduction of multiple temporal granularities in such a model poses several interesting issues. In particular, object-oriented query languages provide ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990